NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Stochastic Optimization and Learning for Two-Stage Supplier Problems

https://doi.org/10.1145/3604619

Brubach, Brian; Grammel, Nathaniel; Harris, David G; Srinivasan, Aravind; Tsepenekas, Leonidas; Vullikanti, Anil (March 2025, ACM Transactions on Probabilistic Machine Learning)

The main focus of this article is radius-based (supplier) clustering in the two-stage stochastic setting with recourse, where the inherent stochasticity of the model comes in the form of a budget constraint. In addition to the standard (homogeneous) setting where all clients must be within a distance\(R\)of the nearest facility, we provide results for the more general problem where the radius demands may beinhomogeneous(i.e., different for each client). We also explore a number of variants where additional constraints are imposed on the first-stage decisions, specifically matroid and multi-knapsack constraints, and provide results for these settings. We derive results for the most general distributional setting, where there is only black-box access to the underlying distribution. To accomplish this, we first develop algorithms for thepolynomial scenariossetting; we then employ a novelscenario-discardingvariant of the standardSample Average Approximationmethod, which crucially exploits properties of the restricted-case algorithms. We note that the scenario-discarding modification to the SAA method is necessary to optimize over the radius.
more » « less
Free, publicly-accessible full text available March 31, 2026
Online Matching Frameworks Under Stochastic Rewards, Product Ranking, and Unknown Patience

https://doi.org/10.1287/opre.2021.0371

Brubach, Brian; Grammel, Nathaniel; Ma, Will; Srinivasan, Aravind (October 2023, Operations Research)

In e-commerce, customers have an unknown patience in terms of how far down the page they are willing to scroll. In light of this, how should products be ranked? The e-commerce retailer’s problem is further complicated by the fact that the supply of each product may be limited, and that multiple customers who are interested in these products will arrive over time. In “Online Matching Frameworks Under Stochastic Rewards, Product Ranking, and Unknown Patience,” Brubach, Grammel, Ma, and Srinivasan provide a general framework for studying this complicated problem that decouples the product ranking problem for a single customer from the online matching of products to multiple customers over time. They also develop a better algorithm for the single-customer product ranking problem under well-studied cascade-click models. Finally, they introduce a model where the products are also arriving over time and cannot be included in the search rankings until they arrive.
more » « less
Full Text Available
Fair Labeled Clustering

https://doi.org/10.1145/3534678.3539451

Esmaeili, Seyed A.; Duppala, Sharmila; Dickerson, John P.; Brubach, Brian (August 2022, KDD '22: Proceedings of the 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining)

Full Text Available
Fair Clustering Under a Bounded Cost

Esmaeili, Seyed A.; Brubach, Brian; Srinivasan, Aravind; Dickerson, John P. (December 2021, Proc. Conference on Neural Information Processing Systems (NeurIPS))

Clustering is a fundamental unsupervised learning problem where a data-set is partitioned into clusters that consist of nearby points in a metric space. A recent variant, fair clustering, associates a color with each point representing its group membership and requires that each color has (approximately) equal representation in each cluster to satisfy group fairness. In this model, the cost of the clustering objective increases due to enforcing fairness in the algorithm. The relative increase in the cost, the “price of fairness,” can indeed be unbounded. Therefore, in this paper we propose to treat an upper bound on the clustering objective as a constraint on the clustering problem, and to maximize equality of representation subject to it. We consider two fairness objectives: the group utilitarian objective and the group egalitarian objective, as well as the group leximin objective which generalizes the group egalitarian objective. We derive fundamental lower bounds on the approximation of the utilitarian and egalitarian objectives and introduce algorithms with provable guarantees for them. For the leximin objective we introduce an effective heuristic algorithm. We further derive impossibility results for other natural fairness objectives. We conclude with experimental results on real-world datasets that demonstrate the validity of our algorithms.
more » « less
Full Text Available
Fairness, Semi-Supervised Learning, and More: A General Framework for Clustering with Stochastic Pairwise Constraints

Brubach, Brian; Chakrabarti, Darshan; Dickerson, John P; Srinivasan, Aravind; Tsepenekas, Leonidas (April 2021, Proceedings of the AAAI Conference on Artificial Intelligence)

Metric clustering is fundamental in areas ranging from Combinatorial Optimization and Data Mining, to Machine Learning and Operations Research. However, in a variety of situations we may have additional requirements or knowledge, distinct from the underlying metric, regarding which pairs of points should be clustered together. To capture and analyze such scenarios, we introduce a novel family of stochastic pairwise constraints, which we incorporate into several essential clustering objectives (radius/median/means). Moreover, we demonstrate that these constraints can succinctly model an intriguing collection of applications, including among others Individual Fairness in clustering and Must-link constraints in semi-supervised learning. Our main result consists of a general framework that yields approximation algorithms with provable guarantees for important clustering objectives, while at the same time producing solutions that respect the stochastic pairwise constraints. Furthermore, for certain objectives we devise improved results in the case of Must-link constraints, which are also the best possible from a theoretical perspective. Finally, we present experimental evidence that validates the effectiveness of our algorithms.
more » « less
Full Text Available
Probabilistic Fair Clustering

Esmaeili, Seyed; Brubach, Brian; Tsepenekas, Leonidas; Dickerson, John (December 2020, Advances in neural information processing systems)
null (Ed.)
Full Text Available
Fair Clustering Under a Bounded Cost

Esmaeili, Seyed; Brubach, Brian; Srinivasan, Aravind; Dickerson, John P (January 2021, Advances in Neural Information Processing Systems 34)

Clustering is a fundamental unsupervised learning problem where a dataset is partitioned into clusters that consist of nearby points in a metric space. A recent variant, fair clustering, associates a color with each point representing its group membership and requires that each color has (approximately) equal representation in each cluster to satisfy group fairness. In this model, the cost of the clustering objective increases due to enforcing fairness in the algorithm. The relative increase in the cost, the `''price of fairness,'' can indeed be unbounded. Therefore, in this paper we propose to treat an upper bound on the clustering objective as a constraint on the clustering problem, and to maximize equality of representation subject to it. We consider two fairness objectives: the group utilitarian objective and the group egalitarian objective, as well as the group leximin objective which generalizes the group egalitarian objective. We derive fundamental lower bounds on the approximation of the utilitarian and egalitarian objectives and introduce algorithms with provable guarantees for them. For the leximin objective we introduce an effective heuristic algorithm. We further derive impossibility results for other natural fairness objectives. We conclude with experimental results on real-world datasets that demonstrate the validity of our algorithms.
more » « less
Full Text Available
A Pairwise Fair and Community-preserving Approach to k-Center Clustering

Brubach, Brian; Chakrabarti, Darshan; Dickerson, John; Khuller, Samir; Srinivasan, Aravind; Tsepenekas, Leonidas (July 2020, International Conference on Machine Learning (ICML))
null (Ed.)
Full Text Available
Algorithms to Approximate Column-Sparse Packing Problems

Brubach, Brian; Sankararaman, Karthik; Xu, Pan; Srinivasan, Aravind (January 2020, ACM transactions on algorithms)

Full Text Available
Algorithms to Approximate Column-Sparse Packing Problems

Brubach, Brian; Sankararaman, Karthik A; Srinivasan, Aravind; Xu, Pan (January 2018, ACM-SIAM Symposium on Discrete Algorithms (SODA))

Column-sparse packing problems arise in several contexts in both deterministic and stochastic discrete optimization. We present two unifying ideas, (non-uniform) attenuation and multiple-chance algorithms, to obtain improved approximation algorithms for some well-known families of such problems. As three main examples, we attain the integrality gap, up to lower-order terms, for known LP relaxations for k-column sparse packing integer programs (Bansal et al., Theory of Computing, 2012) and stochastic k-set packing (Bansal et al., Algorithmica, 2012), and go “half the remaining distance” to optimal for a major integrality-gap conjecture of Furedi, Kahn and Seymour on hypergraph matching (Combinatorica, 1993).
more » « less
Full Text Available

Search for: All records